Search CORE

16 research outputs found

Combining Prior Knowledge and Data: Beyond the Bayesian Framework

Author: Epshteyn Arkady
Publication venue
Publication date: 01/04/2007
Field of study

For many tasks such as text categorization and control of robotic systems, state-of-the art learning systems can produce results comparable in accuracy to those of human subjects. However, the amount of training data needed for such systems can be prohibitively large for many practical problems. A text categorization system, for example, may need to see many text postings manually tagged with their subjects before it learns to predict the subject of the next posting with high accuracy. A reinforcement learning (RL) system learning how to drive a car needs a lot of experimentation with the actual car before acquiring the optimal policy. An optimizing compiler targeting a certain platform has to construct, compile, and execute many versions of the same code with different optimization parameters to determine which optimizations work best. Such extensive sampling can be time-consuming, expensive (in terms of both expense of the human expertise needed to label data and wear and tear on the robotic equipment used for exploration in case of RL), and sometimes dangerous (e.g., an RL agent driving the car off the cliff to see if it survives the crash). The goal of this work is to reduce the amount of training data an agent needs in order to learn how to perform a task successfully. This is done by providing the system with prior knowledge about its domain. The knowledge is used to bias the agent towards useful solutions and limit the amount of training needed. We explore this task in three contexts: classification (determining the subject of a newsgroup posting), control (learning to perform tasks such as driving a car up the mountain in simulation), and optimization (optimizing performance of linear algebra operations on different hardware platforms). For the text categorization problem, we introduce a novel algorithm which efficiently integrates prior knowledge into large margin classification. We show that prior knowledge simplifies the problem by reducing the size of the hypothesis space. We also provide formal convergence guarantees for our algorithm. For reinforcement learning, we introduce a novel framework for defining planning problems in terms of qualitative statements about the world (e.g., ``the faster the car is going, the more likely it is to reach the top of the mountain''). We present an algorithm based on policy iteration for solving such qualitative problems and prove its convergence. We also present an alternative framework which allows the user to specify prior knowledge quantitatively in form of a Markov Decision Process (MDP). This prior is used to focus exploration on those regions of the world in which the optimal policy is most sensitive to perturbations in transition probabilities and rewards. Finally, in the compiler optimization problem, the prior is based on an analytic model which determines good optimization parameters for a given platform. This model defines a Bayesian prior which, combined with empirical samples (obtained by measuring the performance of optimized code segments), determines the maximum-a-posteriori estimate of the optimization parameters

Illinois Digital Environment for Access to Learning and Scholarship Repository

BERT for Long Documents: A Case Study of Automated ICD Coding

Author: Adeel Shabir
Afkanpour Arash
Bassani Hansenclever
Cheung Donny
Epshteyn Arkady
Fan Hongbo
Fomitchev Mikhail
Jones Isaac
Kanal Elli
Malihi Mahan
Nauth Adrian
Sinha Raj
Woonna Sanjana
Zamani Shiva
Publication venue
Publication date: 04/11/2022
Field of study

Transformer models have achieved great success across many NLP problems. However, previous studies in automated ICD coding concluded that these models fail to outperform some of the earlier solutions such as CNN-based models. In this paper we challenge this conclusion. We present a simple and scalable method to process long text with the existing transformer models such as BERT. We show that this method significantly improves the previous results reported for transformer models in ICD coding, and is able to outperform one of the prominent CNN-based methods

arXiv.org e-Print Archive

Evaluation of individual and ensemble probabilistic forecasts of COVID-19 mortality in the United States

Author: Abbott Sam
Abernethy Neil F.
Adee Madeline
Adhikari Bijaya
Arik Sercan O.
Asplund John
Ayer Turgay
Baccam Prasith
Baek Jackie
Baer Thomas M.
Ban Xuegang
Bannur Nayana
Barber Ryan
Baxter Arden
Ben-Nun Michal
Bennouna Mohammed Amine
Bertsimas Dimitris
Bian Jiang
Biegel Hannah
Bien Jacob
Biggerstaff Matthew
Bosse Nikos I.
Bracher Johannes
Brennen Andrea
Brooks Logan
Burant John C.
Cao Wei
Castro Rivadeneira Alvaro J.
Castro Lauren
Cavany Sean
Cegan Jeffrey C.
Celi Leo A.
Chen Jinghui
Chen Samuel
Chen YangQuan
Chhatwal Jagpreet
Chinazzi Matteo
Corsetti Sabrina M.
Cramer Estee Y.
Cui Jiaming
Dahan Maytal
Dalgic Ozden O.
Davis Jessica T.
Della Penna Nicolas
Dent Juan
DesRoches David
Dettwiller Ian D.
Deva Ayush
Drake John M.
Dusenberry Mike
Eisenberg Marisa C.
England William P.
Epshteyn Arkady
España Guido
Fairchild Geoffrey
Falb Karl
Faraone Stephen V.
Farias Vivek
Farthing Matthew W.
Forli Pedro
Fox Spencer
Funk Sebastian
Gaither Kelly
Gakidou Emmanuela
Gao Lei
Gao Liyao
Gao Zhifeng
Gardner Lauren
George Glover E.
Georgescu Andreea
Gerding Aaron
Gibson Graham Casey
Gneiting Tilmann
Grantz Kyra H.
Green Alden
Gu Quanquan
Gu Youyang
Gu Zhiling
Guertin Stephanie L.
Guo Lihong
Gurung Heidi L.
Hamory Bruce
Hay Simon I.
Hellewell Joel
Hess Jonathan
Hill Alison L.
Ho Lam Si Tung
Hong Qi-Jun
House Katie H.
Hu Addison J.
Huang Yitao
Huang Yuxin
Hulme-Lowe Christopher
Hunter Robert H.
Huynh Huong
Jadbabaie Ali
Jahja Maria
Jain Sansiddh
Jayawardena Dasuni
Jin Xiaoyong
Johansson Michael A.
Kalantari Rahi
Kaminsky Joshua
Kaminsky Kathryn
Kanal Elli
Kanji Abdul H.
Karlen Dean
Keegan Lindsay T.
Keskinocak Pinar
Khandelwal Ayush
Kim Myungjin
Kinsey Matt
Kong Stanley
Koyluoglu Ugur
Kraus Andrea
Kraus David
Kulkarni Mihir
Kyriakides Christina
Lachmann Michael
Ladd Mary A.
Lafferty Brandon
Lauer Stephen A.
Lavista Ferres Juan
Le Khoa
Le Long T.
Lee Elizabeth C.
Lega Joceline
Leis Helen
Lemaitre Joseph C.
Lessler Justin
Levi Retsef
Li Chaozhuo
Li Chun-Liang
Li Michael L.
Li Xinyi
Lim Steve
Linas Benjamin P.
Linkov Igor
Liu Tie-Yan
Lopez Velma K.
Ma Yian
Marshall Maximilian
Martin Emily T.
Mayo Michael L.
McCauley Ella
McConnell Steve
McDonald Daniel
Meakin Sophie R.
Meredith Hannah R.
Merugu Srujana
Meyers Lauren Ancel
Michaud Isaac
Milliken John
Moloney Michael
Moore Sean
Morgan James
Morley Christopher P.
Mu Kunpeng
Mueller Peter
Mullany Luke C.
Murray Chris
Myers Robert L.
Mühlemann Anja
Nagraj V. P.
Narasimhan Balasubramanian
Niemi Jarad
Nirgudkar Ninad
Nixon Kristen
Nze-Ndong David
Oidtman Rachel
Oruc Buse Eylul
Osthus Dave
Ozcan Gokce
O’Dea Eamon B.
Pagano Robert
Parno Matthew D.
Pastore y Piontti Ana
Pei Sen
Perakis Georgia
Perez-Saez Javier
Perkins Alex
Pfister Tomas
Pigott David
Piwonka Noah
Politsch Collin
Prakash B. Aditya
Rainwater-Lovett Kaitlin
Rajanala Samyak
Raval Alpan
Ravi Matt
Ray Evan L.
Reich Nicholas G.
Reiner Robert C.
Riley Pete
Riley Steven
Rodríguez Alexander
Rowland Michael A.
Rumack Aaron
Salekin Asif
Sarker Arnab
Sava Dario
Schrader Chris
Schwarz Tom
Scott James G.
Serban Nicoleta
Shah Apurv
Shah Devavrat
Shah Sam
Shakhnovich Elizabeth
Shaman Jeffrey
Sheldon Daniel
Sherratt Katharine
Shi Yunfeng
Shin Lauren
Shingi Siddhant
Siegel Daniel
Simon Noah
Singhvi Divya
Sinha Deeksha
Sinha Rajarishi
Skali Lami Omar
Slayton Rachel B.
Smith Claire P.
Soni Saksham
Spantidakis Ioannis
Spatz Ryan
Srivastava Ajitesh
Stage Steven A.
Stark Ariane
Stiefeling Chris
Suchoski Bradley T.
Sundar Saketh
Tabassum Anika
Tallaksen Katharine
Tazi Bouardi Hamza
Tec Mauricio
Thayaparan Leann
Tibshirani Rob
Tibshirani Ryan J.
Tiwari Avtansh
Tran Quoc T.
Truelove Shaun A.
Trump Benjamin D.
Tsai Thomas
Tsiourvas Asterios
Turner Stephen D.
Turtle James A.
van de Walle Axel
Ventura Valerie
Vespignani Alessandro
Walker Jo W.
Walraven Robert
Wang Dongliang
Wang Guannan
Wang Lily
Wang Lingxiao
Wang Qinxia
Wang Yijin
Wang Yu-Xiang
Wang Yuanjia
Wang Yueying
Wasserman Larry
Wattanachit Nutcha
White Jerome
Wilde Joshua
Wilkinson Barrie
Wills Josh
Wilson Shelby
Wolfinger Russ
Wong Alexander
Woody Spencer
Wu Dongxia
Xiao Jade
Xie Jiajia
Xie Shanghong
Xie Xing
Xiong Xinyue
Xu Pan
Yamana Teresa K.
Yan Xifeng
Yoder Nate
Yoon Jinsung
Yu Rose
Yu Shan
Zeng Donglin
Zhang Leyou
Zhang Shun
Zhang Weitong
Zhang-James Yanli
Zhao Yanting
Zheng Andrew
Zheng Shun
Zhou Mingyuan
Zorn Martha W.
Zou Difan
Publication venue: National Academy of Sciences
Publication date: 04/05/2022
Field of study

Short-term probabilistic forecasts of the trajectory of the COVID-19 pandemic in the United States have served as a visible and important communication channel between the scientific modeling community and both the general public and decision-makers. Forecasting models provide specific, quantitative, and evaluable predictions that inform short-term decisions such as healthcare staffing needs, school closures, and allocation of medical supplies. Starting in April 2020, the US COVID-19 Forecast Hub (https://covid19forecasthub.org/) collected, disseminated, and synthesized tens of millions of specific predictions from more than 90 different academic, industry, and independent research groups. A multimodel ensemble forecast that combined predictions from dozens of groups every week provided the most consistently accurate probabilistic forecasts of incident deaths due to COVID-19 at the state and national level from April 2020 through October 2021. The performance of 27 individual models that submitted complete forecasts of COVID-19 deaths consistently throughout this year showed high variability in forecast skill across time, geospatial units, and forecast horizons. Two-thirds of the models evaluated showed better accuracy than a naïve baseline model. Forecast accuracy degraded as models made predictions further into the future, with probabilistic error at a 20-wk horizon three to five times larger than when predicting at a 1-wk horizon. This project underscores the role that collaboration and active coordination between governmental public-health agencies, academic modeling teams, and industry partners can play in developing modern modeling capabilities to support local, state, and federal response to outbreaks

KITopen

The United States COVID-19 Forecast Hub dataset

Author: Abbott Sam
Abu-Mostafa Yaser
Adee Madeline
Adhikari Bijaya
Adiga Aniruddha
Arik Sercan O.
Asplund John
Ayer Turgay
Baccam Prasith
Baek Jackie
Baer Thomas M.
Ban Xuegang
Bannur Nayana
Barber Ryan
Bathwal Rahil
Baxter Arden
Bejar Benjamín
Belov Artur A.
Ben-Nun Michal
Bennouna Amine
Berlin Abraham
Bertsimas Dimitris
Bhatia Sangeeta
Bian Jiang
Biegel Hannah
Bien Jacob
Biggerstaff Matthew
Bosch Jurgen
Bosse Nikos I.
Bouardi Hamza Tazi
Bracher Johannes
Brennen Andrea
Brenner Michael
Brooks Logan
Budzinski Jozef
Burant John C.
Cao Duy
Cao Wei
Castro Lauren
Cavany Sean
Cegan Jeffrey C.
Celi Leo A.
Chang Nicholas A.
Chattopadhyay Ishanu
Chen Jinghui
Chen Samuel
Chen YangQuan
Chen Ye
Chen Yixian
Chhatwal Jagpreet
Chiang Wen-Hao
Chinazzi Matteo
Chintanippu Krishna
Chitta Pavan
Cho Jae H.
Choirat Christine
Chow Carson C.
Coram Marc
Cornell Matthew
Corsetti Sabrina M.
Cramer Estee Y.
Cui Jiaming
Dahan Maytal
Dalgic Ozden O.
Davis Jessica T.
DesRoches David
Dettwiller Ian D.
Deva Ayush
Drake John M.
Dusenberry Mike
Edwards Jessie K.
Eisenberg Marisa C.
England William P.
Epshteyn Arkady
Erickson Anne
España Guido
Fairchild Geoffrey
Falb Karl
Faraone Stephen V.
Farias Vivek
Farthing Matthew W.
Ferres Juan Lavista
Flahault Antoine
Fong Chung-Yan
Forli Pedro
Fox Spencer
Funk Sebastian
Gaikedu Emmanuela
Gaither Kelly
Galasso Joseph
Gandhi Parth D.
Gao Junyi
Gao Lei
Gao Liyao
Gao Zhifeng
Gardner Lauren
George Glover E.
Georgescu Andreea
Gerding Aaron
Gerkin Richard C.
Gibson Graham Casey
Glass Lucas
Gneiting Tilmann
Goel Sumit
Gowda Jethin
Grantz Kyra H.
Green Alden
Gu Quanquan
Gu Youyang
Gu Zhiling
Guertin Stephanie L.
Guo Lihong
Gurung Heidi L.
Hamory Bruce
Hay Simon
Hellewell Joel
Hess Jonathan
Hill Alison L.
Hlavacek William
Ho Lam
Hong Qi-Jun
House Katie
Hu Addison J.
Huang Yi
Huang Yitao
Huang Yuxin
Hulme-Lowe Christopher
Hulse Juan Dent
Hunter Robert H.
Hurt Benjamin
Hussain Fazle
Huynh Huong
Ibrahim Mark
Ivy Julie S.
Jadbabaie Ali
Jahja Maria
Jain Chaman
Jain Chandini
Jain Sansiddh
Jayawardena Dasuni
Jin Qixuan
Jin Xiaoyong
Jivane Viresh
Jo Areum
Jo HyeongChan
Johansson Michael A.
Joshi Keya
Kalantari Rahi
Kaminsky Joshua
Kaminsky Kathryn
Kanal Elli
Kanji Abdul Hannan
Karimzadeh Morteza
Karlen Dean
Keegan Lindsay T.
Keskinocak Pinar
Khan Zeina
Khandelwal Ayush
Khurana Ankita
Kim Juhyun
Kim Myungjin
Kinsey Matt
Klein Ellen
Koyluoglu Ugur
Kraus Andrea
Kraus David
Krymova Ekaterina
Kulkarni Mihir
Kulkarni Pranav
Kumar Ajay
Kyriakides Christina
Lachmann Michael
Lacroix Timothee
Ladd Mary A.
Lafferty Brandon
Lakhani Anshul
Lami Omar Skali
Lauer Stephen A.
Le Khoa
Le Long T.
Le Matthew
Lee Elizabeth C.
Lee Gavin
Lega Joceline
Leis Helen
Lemaitre Joseph C.
Lessler Justin
Levi Retsef
Lewis Bryan
Li Chaozhuo
Li Chun-Liang
Li Michael L.
Li Xinyi
Liao Jason
Lim Steve
Lin Yen Ting
Linas Benjamin P.
Linkov Igor
Liu Tie-Yan
Lopez Velma K.
Lu Guoqing
Lucas Benjamin
Lushtak Samuel M.
Ma Yian
Mallela Abhishek
Manetti Elisa
Mann Ethan
Marathe Madhav
Marshall Maximilian
Martin Emily T.
Mayo Michael L.
Mayorga Maria E.
McAndrew Thomas
McCauley Ella
McConnell Steve
McDonald Daniel
Meakin Sophie R.
Mehrotra Prakhar
Mele Jessica
Meredith Hannah R.
Merugu Srujana
Meyers Lauren Ancel
Michaud Isaac
Miller Ely
Milliken John
Mody Vidhi
Mody Vrushti
Mohler George
Moloney Michael
Moore Sean
Morgan James
Morley Christopher P.
Mu Kunpeng
Mueller Peter
Mullany Luke C.
Murray Chris
Myers Robert L.
Mühlemann Anja
Nagraj V. P.
Namigai Kristen
Narasimhan Balasubramanian
Ndong David Nze
Neumann Jacob
Ngo Thoai
Nickel Maximilian
Niemi Jarad
Nirgudkar Ninad
Nixon Kristen
Nouvellet Pierre
Obozinski Guillaume
Oidtman Rachel
Oruc Buse Eylul
Osthus Dave
Ozcan Gokce
O’Dea Eamon B.
Pagano Robert
Panaggio Mark J.
Parno Matthew D.
Pasumarty Sujitha
Peddireddy Akhil Sai
Penna Nicolas D.
Perakis Georgia
Perez-Saez Javier
Perkins Alex
Pfeiffer Ruth
Pfister Tomas
Pigott David
Piontti Ana Pastore y
Piriya Matthew
Piwonka Noah
Politsch Collin
Popken Max
Porebski Przemyslaw
Posner Richard
Prakash B. Aditya
Qian Cheng
Rainwater-Lovett Kaitlin
Rajanala Samyak
Raval Alpan
Ravi Matt
Ray Evan L.
Reich Nicholas G.
Reich Nicholas G.
Reiner Robert C.
Riley Pete
Riley Steven
Rivadeneira Alvaro J. Castro
Rodríguez Alexander
Romberg Justin
Rosenstrom Erik T.
Rowland Michael A.
Rumack Aaron
Sagun Levent
Salekin Asif
Sarker Arnab
Schrader Chris
Schwarz Tom
Scott James G.
Sen Pei
Serban Nicoleta
Shah Apurv
Shah Devavrat
Shah Sam
Shakhnovich Elizabeth
Shaman Jeffrey
Sharma Rakshith
Sheldon Daniel
Sherratt Katharine
Shi Yunfeng
Shin Lauren
Shingi Siddhant
Shrivastav Monika
Siegel Daniel
Simon Noah
Singhvi Divya
Sinha Deeksha
Sinha Rajarishi
Slayton Rachel B.
Smith Claire P.
Soni Saksham
Soohoo Connor
Spaeder Jeffrey
Spantidakis Ioannis
Spatz Ryan
Srivastava Ajitesh
Stage Steven A.
Stark Ariane
Stiefeling Chris
Suchoski Bradley T.
Sumner Timothy
Sun Jimeng
Sun Tao
Sundar Saketh
Swann Julie L.
Tabassum Anika
Tallaksen Katharine
Tec Mauricio
Thanou Dorina
Thayaparan Leann
Tibshirani Rob
Tibshirani Ryan J.
Tirumala Kushal
Tiwari Avtansh
Tomar Vishal
Tran Quoc
Truelove Shaun A.
Trump Benjamin D.
Tsai Thomas
Tseng Albert
Tsiourvas Asterios
Turner Stephen D.
Turtle James
US COVID-19 Forecast Hub Consortium
Vahedi Behzad
Van Bussel Frank
van de Walle Axel
Varadarajan Vignesh
Venkatramanan Srinivasan
Ventura Valerie
Vespignani Alessandro
Vytheeswaran Jagath
Walker Jo W.
Walraven Robert
Wang Christopher
Wang Dongdong
Wang Dongliang
Wang Guannan
Wang Lijing
Wang Lily
Wang Lingxiao
Wang Liqiang
Wang Qinxia
Wang Yijin
Wang Yu-Xiang
Wang Yuanjia
Wang Yueying
Wang Zhongying
Wasserman Larry
Wattanchit Nutcha
Weisberg Shane
White Jerome
Wilde Joshua
Wilkinson Barrie
Wills Josh
Wilson Austin
Wilson Daniel
Wilson Shelby
Wolffram Daniel
Wolfinger Russ
Wong Alexander
Woody Spencer
Wu Dongxia
Xiao Cao
Xiao Jade
Xie Jiajia
Xie Shanghong
Xie Xing
Xiong Xinyue
Xu Pan
Xu Tianjian
Yamana Teresa K.
Yan Xifeng
Yeluri Akshay
Yeung Dit-Yan
Yoder Nate
Yogurtcu Osman N.
Yoon Jinsung
You Jialu
Yu Rose
Yu Shan
Yurk Dominic
Zeng Donglin
Zhang Leyou
Zhang Michael
Zhang Shun
Zhang Shunpu
Zhang Weitong
Zhang-James Yanli
Zhao Yanting
Zheng Andrew
Zheng Shun
Zhou Mingyuan
Zimmerman Peter
Zlokapa Alexander
Zoraghein Hamidreza
Zorn Martha W.
Zou Difan
Zou Zihang
Publication venue: Nature Research
Publication date: 17/08/2022
Field of study

Academic researchers, government agencies, industry groups, and individuals have produced forecasts at an unprecedented scale during the COVID-19 pandemic. To leverage these forecasts, the United States Centers for Disease Control and Prevention (CDC) partnered with an academic research lab at the University of Massachusetts Amherst to create the US COVID-19 Forecast Hub. Launched in April 2020, the Forecast Hub is a dataset with point and probabilistic forecasts of incident cases, incident hospitalizations, incident deaths, and cumulative deaths due to COVID-19 at county, state, and national, levels in the United States. Included forecasts represent a variety of modeling approaches, data sources, and assumptions regarding the spread of COVID-19. The goal of this dataset is to establish a standardized and comparable set of short-term forecasts from modeling teams. These data can be used to develop ensemble models, communicate forecasts to the public, create visualizations, compare models, and inform policies regarding COVID-19 mitigation. These open-source data are available via download from GitHub, through an online API, and through R packages

KITopen

Combining Prior Knowledge and Data: Beyond the Bayesian Framework

Author: Epshteyn Arkady
Publication venue
Publication date
Field of study

114 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2007.We explore this task in three contexts: classification (determining the subject of a newsgroup posting), control (learning to perform tasks such as driving a car up a mountain in simulation), and optimization (optimizing performance of linear algebra operations on different hardware platforms). For the text categorization problem, we introduce a novel algorithm which efficiently integrates prior knowledge into large margin classification. For reinforcement learning, we introduce a novel framework for defining and solving planning problems in terms of qualitative statements about the world. In compiler optimization, Bayesian prior based on an analytic model of hardware is combined with empirical measurements of performance of optimized code to determine the maximum-a-posteriori estimates of the optimization parameters.U of I OnlyRestricted to the U of I community idenfinitely during batch ingest of legacy ETD

Illinois Digital Environment for Access to Learning and Scholarship Repository

Rotational prior knowledge for SVMs

Author: Arkady Epshteyn
Gerald Dejong
Publication venue
Publication date: 01/01/2005
Field of study

Abstract. Incorporation of prior knowledge into the learning process can significantly improve low-sample classification accuracy. We show how to introduce prior knowledge into linear support vector machines in form of constraints on the rotation of the normal to the separating hyperplane. Such knowledge frequently arises naturally, e.g., as inhibitory and excitatory influences of input variables. We demonstrate that the generalization ability of rotationally-constrained classifiers is improved by analyzing their VC and fat-shattering dimensions. Interestingly, the analysis shows that large-margin classification framework justifies the use of stronger prior knowledge than the traditional VC framework. Empirical experiments with text categorization and political party affiliation prediction confirm the usefulness of rotational prior knowledge.

CiteSeerX

MODULAR SOFT COMPUTING APPROACH FOR AIRCRAFT CARRIER LANDING TRAJECTORY PREDICTION

Author: ARKADY EPSHTEYN
CHRIS TSENG
Publication venue
Publication date
Field of study

A modular learning design for classifying aircraft flight data in time-series prediction is proposed in this paper. This is part of the decision support system to assist landing signal officers in guiding aircraft to land on aircraft carriers. NeuroFuzzy systems are used to emulate the flight patterns for future real-time flight prediction. To improve the learning efficiency, a two stage modular learning design is proposed. The data to be learned is first decomposed into categories in accordance to their physical structure. Each module of data is presented to a different NeuroFuzzy system for learning purpose. Individually trained modules are modeled as genetic chromosomes. Genetic algorithm is used to produce a chromosome module that represents a generalization of all the trained modules. As compared with the non-modular approach, the modular approach offers comparable prediction performance with significantly lower overall computation time. We show that the reduction in computation time with the modular approach is exponential as the problem size increases. Navy aircraft data were used to validate the effectiveness of the modular design and the result is consistent and promising.Aircraft trajectory, genetic algorithm, modular design, neurofuzzy system

Research Papers in Economics

Generative Prior Knowledge for Discriminative Classification

Author: Arkady Epshteyn
Gerald Dejong
Publication venue
Publication date
Field of study

We present a novel framework for integrating prior knowledge into discriminative classifiers. Our framework allows discriminative classifiers such as Support Vector Machines (SVMs) to utilize prior knowledge specified in the generative setting. The dual objective of fitting the data and respecting prior knowledge is formulated as a bilevel program, which is solved (approximately) via iterative application of second-order cone programming. To test our approach, we consider the problem of using WordNet (a semantic database of English language) to improve low-sample classification accuracy of newsgroup categorization. WordNet is viewed as an approximate, but readily available source of background knowledge, and our framework is capable of utilizing it in a flexible way. 1

CiteSeerX

Qualitative Reinforcement Learning

Author: Arkady Epshteyn
Gerald Dejong
Publication venue
Publication date
Field of study

When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no such specification is available, the agent’s only recourse is a long and potentially dangerous exploration. We present a framework which allows the expert to specify imprecise knowledge of transition probabilities in terms of stochastic dominance constraints. Our algorithm can be used to find optimal policies for qualitatively specified problems, or, when no such solution is available, to decrease the required amount of exploration. The algorithm’s behavior is demonstrated on simulations of two classic problems: mountain car ascent and cart pole balancing. 1

CiteSeerX